A ScaLAPACK-based Parallelization of LSDALTON
نویسندگان
چکیده
Linear Scaling DALTON (LSDALTON) is a powerful molecular electronic structure program that is the focus of software optimization projects in PRACE 1IP-WP7.2 and PRACE 1IP-WP7.5. This part of the project focuses on the introduction of parallel diagonalization routines from the ScaLAPACK library into the latest MPI version of LSDALTON. The parallelization work has involved three main tasks: i) Redistribution of the matrices assembled for the SCF cycle from a serial / distributed state to the two dimensional block-cyclic data distribution used for PBLAS and ScaLAPACK; ii) Interfacing of LSDALTON data structures to parallel diagonalization routines in ScaLAPACK; iii) Performance testing to determine the favoured ScaLAPACK eigensolver methodology.
منابع مشابه
Porting a sphere optimization program from LAPACK to ScaLAPACK
The sphere optimization program sphopt was originally written as a sequential program using lapack, and was converted to use scalapack, primarily to overcome memory limitations. The conversion was relatively straightforward, using a small number of organizing principles which are widely applicaple to the scalapack parallelization of serial code. The main innovation is the use of a compressed bl...
متن کاملLarge scale ab initio calculations based on three levels of parallelization
We suggest and implement a parallelization scheme based on an efficient multiband eigenvalue solver, called the locally optimal block preconditioned conjugate gradient (lobpcg) method, and using an optimized three-dimensional (3D) fast Fourier transform (FFT) in the ab initio plane-wave code abinit. In addition to the standard data partitioning over processors corresponding to different k-point...
متن کاملParallel Boundary Elements Using Lapack and ScaLapack
The present work introduces the main steps towards the parallelization of existing Boundary Element Method (BEM) codes using available standard and portable libraries for writing parallel programs, such as LAPACK and ScaLAPACK. Here, a well-known BEM Fortran implementation is reviewed and rewritten to run on shared and distributed memory systems. This effort is the initial step to develop a new...
متن کاملDynamic Ordering for the Parallel One-sided Block-jacobi Svd Algorithm
The serial Jacobi algorithm (either one-sided or two-sided) for the computation of a singular value decomposition (SVD) of a general matrix has excellent numerical properties and parallelization potential, but it is considered to be the slowest method for computing the SVD. Even its parallelization with some parallel cyclic (static) ordering of subproblems does not lead to much improvement when...
متن کاملParallelization of Rich Models for Steganalysis of Digital Images using a CUDA-based Approach
There are several different methods to make an efficient strategy for steganalysis of digital images. A very powerful method in this area is rich model consisting of a large number of diverse sub-models in both spatial and transform domain that should be utilized. However, the extraction of a various types of features from an image is so time consuming in some steps, especially for training pha...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012